rank | frequency | n-gram |
---|---|---|
1 | 8221 | -ा |
2 | 7935 | -ी |
3 | 5456 | -र |
4 | 5349 | -ं |
5 | 4319 | -न |
rank | frequency | n-gram |
---|---|---|
1 | 3235 | -ों |
2 | 1242 | -री |
3 | 1161 | -या |
4 | 1097 | -्स |
5 | 1017 | -ता |
rank | frequency | n-gram |
---|---|---|
1 | 879 | -यों |
2 | 702 | -िया |
3 | 505 | -रों |
4 | 498 | -िंग |
5 | 365 | -ारी |
rank | frequency | n-gram |
---|---|---|
1 | 714 | -ियों |
2 | 304 | -ियां |
3 | 166 | -रिया |
4 | 156 | -ारों |
5 | 135 | -सिंह |
rank | frequency | n-gram |
---|---|---|
1 | 143 | -रियों |
2 | 91 | -ेंगे। |
3 | 86 | -तियों |
4 | 60 | -पूर्ण |
5 | 56 | -स्तान |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings